AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Arbitrary Resolution Visual Tokenization

# Arbitrary Resolution Visual Tokenization

VL3 SigLIP NaViT
Apache-2.0
The visual encoder for VideoLLaMA3, utilizing Arbitrary Resolution Visual Tokenization (AVT) technology to dynamically process images and videos of different resolutions.
Text-to-Image Transformers English
V
DAMO-NLP-SG
25.55k
8
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase